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Method 

I hereby explain the method with help of an example experiment vised for cal- 
ibration. In the example experiment the test system was used to assess cyto- 
sine/thymine base ratio measurement methods as used in most methylation de- 
tection protocols using bisulphite treatment of the DNA. 

• First identical regions with local differences are subcloned into plasmids. In 
the experiment this was an inhomogeneous PGR product from incomplete 
bisulphite treated DNA that resulted in a mixture of molecules with different 
C/T proportions at all positions that were cytosine prior to conversion with 
bisulphite. 

• A set of the subclones is sequenced to obtain information about the base 
composition differences. Other methods to determine these differences can 
be used but sequencing of the subclones is the most appropriate method. 
In the experiment we sequenced 96 subclones from one inhomogeneous 
amplificate. 

• A set of subclones is chosen, that compared to each other are different at 
as many positions as possible relevant for the measurement method to be 
assessed. For this method a nimiber of two chosen subclones is the mini- 
mum but three or more lead to a higher resolution. In the experiment we 
chose three clones which differed at positions that in the genomic sequence 
were c5^osine and resulted in either cytosine or thymine dependent on the 
bisulphite conversion (see Fig. 0.1). 

• Mixable amounts of the chosen plasniids are gained by cultivation of the 
subclones and plasmid preparations. The gained plasmid stocks are equili- 
brated to equal concentrations before mixing. 

• The plasmid stocks are mixed in unequal proportions. To gain more test 
mixtures from the same source the proportions are permuted. Though this 
is possible with many proportions we suggest to use proportions based on 2" 
; « € [0, \,2...{cloneNumber— 1)]. In the experiment we mixed the clones 
in the proportions 1:2:4, which resulted in eight equally distributed base 
compositions from 0/7 to 7/7 in steps of 1/7. Permuting the proportions 
allowed to generate six different mixtures^ from the three clones which in 
this experiment covered many measurement points at different levels (see 

' proportion permutations for six different mixtures of tiiree clones: (1:2:4), (2:1:4), (1:4:2), 
(2:4:1), (4:1:2), (4:2:1) 
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Fig. 0.2). A choice of four clones might be used for up to 24 mixtures with 
permutations of the proportions 1:2:4:8 leading to 16 base compositions 
from 0 to 1 in 1/15 steps. 

• The mixtures (in the experiment six) can now be used to assess or calibrate 
methods which measure the base proportions at specific positions. Results 
from th emethod to be calibrated or assessed can be compared with expec- 
tation values based on known proportions in the test system. An example 
for this is given in Fig 0.2 h. 

Use and advantages 
Reproducibility 

Once a test system like the one described is established it can easily and cheaply 
be reproduced with low effort and low risk of changes. More complex systems 
needing more preparation steps (than concentration measurement and mixing), 
e.g. random PGR or enzymatic preparation steps, might not be as robust as the 
provided system and have a high variance from batch to batch. All these charac- 
teristics make test systems based on ihe described method a potential conunercial 
product: easy, reliably and cheap to produce as soon as established. 

Different proportions witliin one mixture 

Mixtures of e.g. methylated and unmethylated DNA only provide one defined 
base proportion to be expected after conversion, it is equal at each position. Any 
problems that might occur from the fact diat other positions have other rates are 
omitted from such system. The test system described here provides different pro- 
portions at different positions within one mixture. Therefore it overcomes the 
problem of the other system, wherein equal proportions at all positions are used 
and thereby might bias measurements. In addition this method allows to generate 
data over a range of measurement points and not only at one defined value, there- 
fore a single mixture can be used to assess the whole range of a measurement 
method. 

Specialized tests based on real DNA patterns 

The method allows to generate test systems providing any wanted composition 
of base proportion at different DNA positions whenever a needed pattern can be 
found in subclones derived from real samples. This allows to always choose the 
appropriate subclones for any analysis method the test system will be applied 
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to. It is e.g- possible to choose stretches that show blocks with equal base pro- 
portions at all sites of interest. This way the influence of such blocks (like local 
co-methylation) on measurement methods can be assessed. The fact that real sam- 
ple material can be used for the initial step of subclone generation allows to easily 
reproduce patterns as observed in nature. E.g. for methylation analysis this offers, 
the opportunity to test sensitive detection methods very precisely and in detail, 
and allows modeling reality in a more appropriate way than by mixing DNA of 
0% and 100% methylation at all positions. 

Single method step assessment 

The generation of mixtures of e.g. methylated and unmethylated DNA requires 
several steps until it can be used to assess a measurement method based on e.g. 
PCR products of bisulphite treated DNA. All these steps influence the real ex- 
pectation values and the results. 1. the production of methylated DNA may be 
incomplete and introduce errors. 2. the bisulphite conversion might be incom- 
plete 3. the amplification in the PGR might be biased or have a higji variance. All 
these steps add to any variance and/or bias in the final measurement method to 
be assessed and caimot easily be separated firom it. In contrast the here provided 
test system allows to asses measurement methods as a whole or its single steps. 
It therefore provides detailed infonnation about single steps and can locate error 
sources more easily than methods that provide only an assessment of a whole 
pipeline of steps. 
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ABSTRACT 



I here present a method that allows to assess and calibrate methods and systems 
that quantify base compositions at special positions in DNA. The method is char- 
acterized by using synthetic, highly reproducible test systems. Said test systems 
are characterized by a) being built by DNA subclone mixtures, b) providing high 
nimibers of measuring points within one DNA subclone mixture. 
The measuring points cover the range of the measurement method to be assessed 
or calibrated in an evenly distributed manner. 

If used to assess a DNA methylation detection method the test system is able 
to test te outcome of single steps of said method and therefore has a huge advan- 
tage compared to methods that can only assess the outcome of multiple steps. 

The method is characterized by the use of mixtures of subclones from one 
and the same DNA region that show base composition differences at positions of 
interest The method is further characterized by taking more than two subclones, 
that among one another are as unequal as possible and mix them in permutations 
of different portions. 

The test system allows to build models with patterns very close to observations in 
real DNA. An established system can easily be used as a standard for optimization 
and calibration experiments for different methods and is a potential commercial 
product. 
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Mutin 

Fig. 0.1 : Three final clones chosen for the mixtures, only genomic C positions and 
their on bisulphite treatment based equivalent (T) are shown. 
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a. selected clones 



b. U90 of tho systom In a real tost 
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Fig. 0.2: a. Number of measuring points for different C/(C+T) ratios within all six 
subclone mixtures of the example, b. real calibration data based on an 
assessment of base ratio detection with foiu: dye capillary sequencing. 
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APPENDIX: data from 96 subclones 




Fig. 0.3: Full sequence of clones from the initial sub-cloning step of G6e (part 1) 
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